Towards a High-Level Audio Framework for Video Retrieval Combining Conceptual Descriptions and Fully-Automated Processes
نویسندگان
چکیده
The growing need for 'intelligent' video retrieval systems leads to new architectures combining multiple characterizations of the video content that rely on highly expressive frameworks while providing fully-automated indexing and retrieval processes. As a matter of fact, addressing the problem of combining modalities within expressive frameworks for video indexing and retrieval is of huge importance and the only solution for achieving significant retrieval performance. This paper presents a multi-facetted conceptual framework integrating multiple characterizations of the audio content for automatic video retrieval. It relies on an expressive representation formalism handling high-level audio descriptions of a video document and a full-text query framework in an attempt to operate video indexing and retrieval on audio features beyond state-of-the-art architectures operating on low-level features and keyword-annotation frameworks. Experiments on the multimedia topic search task of the TRECVID 2004 evaluation campaign validate our proposal.
منابع مشابه
A Framework of Indexation and Document Video Retrieval based of the Conceptual Graphs
Most of the video indexing and retrieval systems suffer from the lack of a comprehensive video model capturing the image semantic richness, the conveyed signal information and the spatial relations between visual entities. To remedy such shortcomings, we present in this paper a video model integrating visual semantics, spatial and signal haracterizations. It relies on an expressive representati...
متن کاملA Framework Integrating Signal / Semantic Visual Characterizations for Conceptual Video Retrieval
Most of the video indexing and retrieval systems suffer from the lack of a comprehensive video model capturing the image semantic richness, the conveyed signal information and the spatial relations between visual entities. To remedy such shortcomings, we present in this paper a video model integrating visual semantics, spatial and signal characterizations. It relies on an expressive representat...
متن کاملMultimedia surrogates for video gisting: Toward combining spoken words and imagery
Good surrogates that allow people to quickly derive the gist of videos without taking the time to view the full video are crucial to video retrieval and browsing systems. Although there are many kinds of textual and visual surrogates used in video retrieval systems, there are few audio surrogates in practice. To evaluate the effectiveness of audio surrogates alone and in combination with one ki...
متن کاملCombining Collaborative Tagging and Ontologies in Image Retrieval Systems
In this paper we propose a combination between collaborative tagging and semantic web technologies for the development of an image repository system. The proposed system will be part of the WESONET project which will also handle other types of resources like video and audio. Our approach combines low level features extracted by automatic means, high level descriptions provided by the content cr...
متن کاملIndexing an intelligent video database using evolutionary control
In this paper we present the implementation of an intelligent video database using evolutionary control. By using automatic video indexing techniques, the retrieval of video segments can be performed using free natural language queries. Retrieval of video segments from a database for editing and viewing is becoming an important topic in video processing. A cinematic movie consists of video segm...
متن کامل